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Abstract 



In this paper, we analyze the coding delay and the average coding delay of Chunked network Codes (CC) over line 
networks with Bernoulli losses and deterministic regular or Poisson transmissions. Chunked codes are an attractive 
alternative to random linear network codes due to their lower complexity. Our results, which include upper bounds on 
t*^ ' the delay and the average delay, are the first of their kind for CC over networks with such probabilistic traffics. These 

results demonstrate that a stand-alone CC or a precoded CC provides a better tradeoff between the computational 
complexity and the convergence speed to the network capacity over the probabilistic traffics compared to arbitrary 
jy! , deterministic traffics. The performance of CC over the latter traffics has already been studied in the literature. 

I. Introduction 
K^ ' Chunked codes (CC), originally proposed in [1], generalize random linear network codes (dense codes), and 

m 

Tlj" ' operate by dividing the message of the source into non-overlapping or overlapping sub-messages of equal size, 



called chunks [l]-[3]. Each node at each transmission time randomly chooses a chunk, and transmits it by using 



CO . a dense code. In fact, a dense code is a CC with only one chunk of the size equal to the message size. Thus, CC 

o 

p^ ■ require less complex coding operations due to applying coding on chunks smaller than the original message. This 

however comes at the cost of lower speed of convergence to the capacity compared to dense codes. 

The speed of convergence of CC to the capacity of line networks with arbitrary deterministic traffics was studied 

X ■ 
^ , in [1], [3]. In particular, it has been shown that (i) a CC achieves the capacity, so long as the size of the chunks 

a ■ 

- - - is lower bounded by a function super-logarithmic in the message size and super-log-cubic in the network length, 

and (ii) a CC, preceded by a capacity-achieving erasure code, approaches the capacity with an arbitrarily small but 
non-zero constant gap, so long as the size of the chunks is lower bounded by a function constant in the message 
size and log-cubic in the network length. There is however no result on the speed of convergence of CC to the 
capacity over the networks with probabilistic traffics. 

The speed of convergence of dense codes to the capacity of some probabilistic traffics was studied in [4], [5]. 
Very recently, in [6], we studied the coding delay and the average coding delay of a dense code over the traffics 

TThis paper is an extended version of an accompanying work submitted to NetCod 2012. 



with deterministic regular or Poisson transmissions and Bernoulli losses.' The results were in some cases more 
general, and in some other cases tighter, than the existing bounds in [4], [5]. 

In this paper, we generalize our analysis in [6], and for the first time, study the coding delay and the average 
coding delay of CC for different ranges of the chunk sizes. ^ 

The main contributions of this work are: 

• We derive upper bounds on the coding delay and the average coding delay of a CC alone, or a CC with 
precoding, over the traffics with deterministic regular transmissions or Poisson transmissions and Bernoulli 
losses with arbitrary parameters or unequal parameters. 

• We show that: (i) a CC achieves the capacity, so long as the size of the chunks is bounded from below 
by a function super-logarithmic in the message size and super-log-linear in the network length, and (ii) the 
combination of a CC and a capacity-achieving erasure code approaches the capacity with an arbitrarily small 
non-zero constant gap, so long as the size of the chunks is bounded from below by a function constant in the 
message size and log-linear in the network length. The lower bounds in both cases are smaller than those over 
the networks with arbitrary deterministic traffics. Thus both coding schemes are less computationally complex 
(require smaller chunks), for the same speed of convergence, over such probabilistic traffics, compared to 
arbitrary deterministic traffics. 

• In a capacity-achieving scenario, for such probabilistic traffics, we show that: (i) the upper bound on the 
overhead^ grows sub-log-linearly with the message size and the network length, and decays sub-linearly with 
the size of the chunks, and (ii) the upper bound on the average overhead grows sub-log-linearly (or poly- 
log-linearly) with the message size, and sub-log-linearly (or log-linearly) with the network length, and decays 
sub-linearly (or linearly) with the size of the chunks, in the case with arbitrary (or unequal) parameters. For 
arbitrary deterministic traffics, the upper bound on the overhead was shown in [3] to be similar to (i), but with 
a larger (super- linear) growth rate with the network length. 

II. Network Model and Problem Setup 

A. Transmission and Loss Model 

We consider a unicast problem (one-source one-sink) over a line network with L links connecting L + 1 nodes 
{vi}o<i<L in tandem. The source node vq has a message of k vectors (called message vectors) from a vector space 
T over F2, and the sink node vl requires all the message vectors."* 

'The coding delay of a code over a network with a given traffic (schedule of transmissions and losses) is the minimum time that the code 
takes to transmit all the message vectors from the source to the sink. The coding delay is a random variable due to the randomness in both the 
code and the traffic. The average coding delay of a code with respect to a class of traffics is the coding delay of the code averaged out over 
all the traffics (but not the codes), and hence is a random variable due to the randomness in the code. 

^In this paper, we focus on CC with non-overlapping chunks. The analysis of CC with overlapping chunks is the focus of an ongoing research 
project. 

^The (average) overhead is the difference between the (average) coding delay and the ratio of the message size to the capacity. 

'*The analysis in this paper is generalizable to finite fields of larger size. 
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Each (non-sink) node at each transmission time transmits a (coded) packet, which is a vector in T . The packet 
transmissions are assumed to occur in discrete-time, and the transmission times over different links are assumed to 
follow independent stochastic processes. The transmission times over the i* link are specified by (i) a deterministic 
process where there is a packet transmission at each time instant, or (ii) a Poisson process with parameter A^ : < 
\i < 1, where A^ is the average number of transmissions per time unit over the i* link. The transmission schedules 
resulting from (i) and (ii) are referred to as deterministic regular and Poisson, respectively. 

Each transmitted packet either succeeds or fails to be received [successful vs. lost). The successful packets are 
assumed to arrive with zero delay, and the lost packets will never arrive. The packets are assumed to be successful 
independently over different links. The successful packets over the i* link are specified by a Bernoulli process with 
(success) parameter Pi : < Pi < 1, where pi is the average number of successes per transmission over the i* link. 
The loss model defined as above is referred to as Bernoulli. The special case of Bernoulli loss with all pi's equal 
to 1 is analogous to the lossless case. 

B. Problem Setup 

The goal in this paper is to derive upper bounds on the coding delay and the average coding delay of chunked 
codes over line networks with deterministic regular or Poisson transmissions and Bernoulli losses. 

In a chunked coding scheme, the set of k message vectors at the source node is divided into q disjoint subsets, 
called chunks, each of size a = k/q. The source node, at each transmission time, chooses a chunk independently at 
random, and transmits a packet by randomly linearly combining the message vectors belonging to the underlying 
chunk. Each non-source non-sink node, at the time of each transmission, chooses a chunk independently at random, 
and transmits a packet by randomly linearly combining its previously received packets pertaining to the underlying 
chunk. The global encoding vector^ of each packet is assumed to be transmitted along with the packet. The sink 
node can decode a chunk, so long as it receives an innovative^ collection of packets pertaining to the underlying 
chunk of a size equal to the size of the chunk. 

III. Deterministic Regular Transmissions and Bernoulli Losses 

We first review the analysis of dense codes, which are a special case of CC with one chunk, in two cases of 
arbitrary or unequal (success) parameters, presented in [6].^ Next, we generalize the analysis to CC with more than 
one chunk. 

'The global encoding vector of a packet is the vector of the coefficients representing the mapping between the message vectors and the 
packet. 

*A collection of packets is innovative if the global encoding vectors of the packets belonging to the collection are linearly independent. 

'The details of the proofs in the case of arbitrary parameters were given in [6] and hence omitted. However, neither the details, nor the 
sketches of the proofs in the case of unequal parameters were given in [6] . We present the sketches of the proofs in this paper for the purpose 
of completeness. 
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A. Dense Codes 

The goal of the analysis is to lower bound (i) the size of a maximal dense collection of packets at the sink node 
until a certain time,^ and then, (ii) the probability that a sufficient number of packets in the underlying collection 
are innovative. 

Let Qi+i and Qi be the decoding matrices^ at the (i + 1)* and i* nodes, respectively, and Ti be a matrix over 
F2 such that Qi+i = TiQi. The entries of Qi+i and Qi are in F2. Each row of Ti is the local encoding vectors of 
a successful packet sent by the i* node. Let Q[ be Qi restricted to its rows corresponding to the global encoding 
vectors of the dense packets at the i* node. Let T/, the transfer matrix at the i* node, be a matrix over F2 such 
that Qi+i = TlQ[. Each row of T/ indicates the labels of the dense packets at the i* node which contribute to a 
successful packet sent by the z* node. 

For every matrix Q over F2, the density of Q, denoted by V{Q), is the size of a maximal dense collection of 
rows in Q, where a collection of rows is dense if the rows have all independent and uniformly distributed Bernoulli 
entries. Further, Q is called a dense matrix if all its rows form a dense collection. For every matrix T over F2, the 
rank of T, denoted by rank(r), is the size of a maximal collection of linearly independent rows in T. 

Lemma 1: Let Q be a dense matrix over F2, and T be a matrix over F2, where the number of rows in Q and 
the number of columns in T are equal. If rank(r) > 7, then V{TQ) > 7. 

Since Qi+i ~ T-Qi, and Q'^ is dense, 'D{Qi^i) is lower bounded so long as rank(r/) is lower bounded. As 
shown in [6], the matrix T/ includes a sub-matrix with the structure of a random block lower-triangular matrix, 
and the rank of a matrix with such a structure is lower bounded as follows. 

Let w, r and {'rj}i<j<w be arbitrary non-negative integers, and let rmax ~ maxj r^ and r^in = min^y-j. Let 
Tjj- be an r X rj dense matrix over F2, if 1 < j < i < w; or an arbitrary r x rj matrix over F2, otherwise. Let 
T = \Tij]i<ij<w The matrix T is called random block lower-triangular (RBLT). 

Lemma 2: Let T be an RBLT matrix with parameters w, r and {rj : < tj < r}i<j<^,. Let u = \{n — 7)/?'niinl, 
and n = J2i<j<w ^3- For every integer < 7 < n-1, Pr{r(T) < 71-7} < u (1 - 2-'^"™) 2-''+"-'"'~+('-'-"™)(«-i). 

Lemma 3: Let T be an RBLT matrix with parameters w, r and {rj : < r < rj}i<j<.ui- Let u = \{n — j)/r~\, 
and n =^ wr. For every integer < 7 < n - 1, Pr{r(r) < n - 7} < u (1 - 2-'') 2-'i'+"-«"'"™+('-"™-'-)("-i). 

The application of the lemmas is subject to useful and tight choices of w, r, and r^'s. Such parameters depend 
on the traffic over the i* and {i + 1)* links, and hence not straightforward to optimize. However, by using a 
probabilistic technique, tight bounds on such parameters can be derived. 

Let {0,Nt] be the period of time over which the transmissions occur Let {0,Nt] be divided into w disjoint 
partitions of length Nt/w. For every I < j < w, and 1 < i < L, let lij be the j* partition pertaining to the i* 
link. For every i,j'.i<j<w — L + i, lij is called active. Let wt == L{w — L + 1) be the total number of active 

*A collection of packets is dense if tlie local encoding vectors of the packets are linearly independent, where the local encoding vector of a 
packet is the vector of the coefficients of the linear combination pertaining to the packet. 

'The global encoding vectors of the packets at a node form the rows of the decoding matrix at the node. 
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partitions. 

Let (pij be the number of successful packets in /y . By the assumption, ipij is a binomial random variable with 
the expected value ipi = piNr/w. Let p = mini<i<iPi, and ip = pMr/w. For any real number x, let i denote 
x/2. By applying the Chemoff bound, one can show that ipij is not larger than or equal to ?'=(!— 'y*)ip with 
probability (w.p.) bounded above by (b.a.b.) e/wr, so long as < 7* < 1, where 7* ^ \J{l/ip) hiiwr/f)- For all 
i, j, suppose that ipij is larger than or equal to r. 

Let V{Ql) be the number of dense packets in the first j active partitions over the i* link. 

The packets over the first link are all dense. Thus, for all j, 'D{Q{) > rj. For any other values of i, j, by applying 

Lemma 2, it can be shown that the inequality 'D{Ql) > rj — j(l + o(l)) log(wT/e) fails w.p. b.a.b. ije/wr, so 

long as 

w\og — o{pNt)- (1) 

e 

This result shows that the number of dense packets at the sink node, ^{Ql), fails to be larger than 

pNr - 0{pNtL/w)- 



0{^pNTw\og{wL/e)) - 0{w \og{wL/e)), (2) 

w.p. b.a.b. e. By condition (1), it follows that each 0{.) term in (2) is o{pNt) which ensures that the code achieves 



the capacity. We specify w by y/ pNxL'^ l^og{pNTL / e) in order to maximize (2) subject to condition (1). 

Let riT be equal to (2). Thus, Ql fails to include an nr x k dense sub-matrix w.p. b.a.b. e. 

Lemma 4: Let Q be an 71 x fc (fc < n) dense matrix over F2. Then, Pr{rank(Q) < fc} < 2^("^'^). 

By Lemma 4, Pr{rank(QL) < k] is b.a.b. e, so long as fc < n^ — log(l/e). By replacing e with e, it follows 
that the sink node can recover all the message vectors w.p. b.a.b. e, so long as fc < tit — log(l/e) ^ 1- Let fc^ax 
be the largest integer fc satisfying this inequality. Thus, fcmax ^ pNx, and by replacing Nt with k/p, the following 
result is immediate. 

Theorem 1: The coding delay of a dense code over a line network of L links with deterministic regular traffics 
and Bernoulli losses with parameters [pi] is larger than 



- fc + (1 + 0(1)) h Wfc [tulog— j +wlog— J j 

w.p. b.a.b. e, where w ^ (fcL^/log(fcL/e)) ^, p = nimi<i<L pi. 

In the case of the average coding delay, the analysis proceeds by replacing r with ip in the preceding results, 
and re-specifying w by ^^pNxL /\og(j)NTL / e) in order to maximize 

pNt - 0{pNtL/w) - 0{w \og{wL/e)), (3) 

instead of (2), subject to condition (1). 

Theorem 2: The average coding delay of a dense code over a network similar to Theorem 1 is larger than 

1 A . , .. fkL , wL 

- fc+ 1 + 1)) hwlog 

p \ \ w e 
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w.p. b.a.b. e, where w ^ {kL /\og{kL / e))^ . 

In order to derive tighter bounds the actual values of the success parameters {pi} need to be taken into 
consideration. In particular, the coding delay and the average coding delay of dense codes for a special case, 
where no two links have equal success parameters, are upper bounded as follows. 

Let us assume pi > p2 > ■ ■ ■ > pl, without loss of generality. Let p = mini<j<iPi, 7e = mini<j<i7e;, and 
lei === \Pi -Pi-i\- Let n^ {I- i*)(pi, where ip^ = PiNr/w and 7* ~ y^{l/ipi)\og{wT/e). Let ipij be defined as 
before. For all i,j, suppose that ipij is larger than or equal to r^. 

Similarly as before, for all j, 'D{Q-[) > rij. For any other values of i,j, by applying Lemma 3, it can be shown 
that the inequality 'D{Ql) > rij fails w.p. b.a.b. ije/wT, so long as 

u; log = oijepNr)- (4) 

e 

Let p, tf, 7* and r denote pL, <fL, 72 and tl, respectively. Thus, the inequality 'V{Ql) > (1 ^ ^*)ifWT/L fails 

w.p. b.a.b. e. By replacing (^ with pNr/w, the right-hand side of the last inequality can be written as: 



pNr - 0{pNtL/w) - 0{\/pNTwlog{wL/e)). (5) 

The rest of the analysis is similar to that of Theorem 1, except that (5) excludes the last term in (2), and the choice 
of w needs to satisfy condition (4), instead of condition (1). 

Theorem 3: Consider a sequence of unequal parameters {pi}i<i<L- The coding delay of a dense code over a 
Une network of L Unks with deterministic regular traffics and Bernoulli losses with parameters {pi} is larger than 



Mfc+(i+o(i))(^ + ALiog^ 

W.p. b.a.b. e, where w -- 7e {kL'^ /\og{kL/e))\ p = mmi<i<LPi, 7e = mini<i<L 7e,, and 7e, = \pi - Pt-i\. 

In the case of the average coding delay, the analysis follows the same line as that of Theorem 2, except that the 
choice of w needs to maximize 

pNt - 0{pNtL/w) (6) 

subject to condition (4), instead of (3) subject to condition (1). 

Theorem 4: The average coding delay of a dense code over a network similar to Theorem 3 is larger than 

w.p. b.a.b. e, where w ^ 7efc/(/(fc) log(fcL/e)), and /(fc) goes to infinity, as k goes to infinity, such that /(fc) = 
o(7efc/log(fci/e)). 

B. CC: Capacity-Achieving 

In a CC, at each transmission time, a chunk is chosen w.p. 1/q, and a packet transmission over the i* link is 
successful w.p. Pi. Thus the probability that a given packet transmission over the i* link is successful and pertains 
to a given chunk is pi/q. Thus by replacing pi with Pi/q in the analysis of dense codes in Section III-A, the coding 
delay and the average coding delay of CC in a capacity-achieving scenario will be upper bounded as follows. 
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The results of dense codes are indeed a special case of those of CC with one chunk of size k. It is, however, 
worth noting that, due to the change in the parameters, the number of partitions w needs to satisfy a new condition: 
wqlog^^^^^^ = o{pNt) or wqlog^^^^^ = o{'^epNT), instead of condition (1) or (4), in the proofs of Theorems 5 
and 6, or those of Theorems 7 and 8, respectively. Further by replacing w with its optimal choice in the new version 
of (2), (3), (5) and (6), each 0{.) term needs to be o{pNt /q) in order to ensure that CC are capacity-achieving 
in the underlying case. Such a condition lower bounds the size of the chunks a by a function super-logarithmic in 
the message size k. 

Theorem 5: The coding delay of a CC with q chunks over a line network of L links with deterministic regular 
traffics and Bernoulli losses with parameters {pi} is larger than 



1 

w.p. b.a.b. e, so long as q = o{k/{L\og{kL/e))), where w ^ (fci^/(glog(fcL/e))) '', and p = mini<i<iPi. 

Proof: The proof follows the same line as in that of Theorem 1 by implementing the following modifications. 
Let us replace p and e with p/q and e/q, respectively. Then, ip = pNx/wq, and r = (1 — ^*)f, where 7* ^ 



v/(l/(^)ln(M;T<?/e). Fix a chunk lu. For all 1 < i < L, and 1 < j < w - L + 1, let P((3-), Vp{Ql), and nj 
be defined as before, but only restricted to the packets pertaining to the chunk uj. Similarly as before, for all i,j, 
V^Ql) can be lower bounded as follows: for all l<j<w — L + 1, 'D{Ql) > rj, and for all other values of i,j, 
'D{Ql) fails to be larger than rj — j(l -I- oil)) \og{wTq/(-), w.p. b.a.b. ije/wrq, so long as 

wqlog = o{pNt)- (7) 

e 

Thus the number of dense packets pertaining to the chunk lj at the sink node fails to be larger than 

pNt „ f pNtL 



q \ wq 



^ /p^^^^^\^|^^^^^u^^|^ (8) 



w.p. b.a.b. e/q. In order to maximize (8) subject to condition (7), we specify w by 



pNtL^ 



y q \og{pNTL/e) ' 
Now let us assume that Nx is (1 + o{l))k/p. By replacing e with e, in the preceding results, and by replacing k 
and e with k/q and e/q, respectively, in Lemma 4, it follows that the sink node fails to decode the chunk uj w.p. 
b.a.b. e/q, so long as Nt is larger than 



- lk + {l + o{l))l'^ + Jk(wq\og^j +^q\og^]] . (9) 

Taking a union bound over all the chunks, it follows that the sink node fails to decode all the chunks w.p. b.a.b. 
e, so long as Nt is larger than (9). To ensure that the lower bound on Nt is (1 + o{l))k/p, all the terms in (9), 
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excluding the first one, need to be o{k/p). This condition is met so long as q is 

k 



L\og{kL/e) 



D 



Theorem 6: The average coding delay of a CC with q chunks over a network similar to Theorem 5 is larger than 

- (fc+(l + o(l)) (—+wq\og^^ 
p \ \ w e 

w.p. b.a.b. e, so long as q ~ o{k/{L\og{kL/e))), where w ^ {kL / {q\og{kL / e)))^ . 

Proof: The proof is similar to that of Theorem 5, except that r needs to be replaced with (f. This implies that 
the third term in (8) disappears. Thus by specifying w with 



pNtL 



y qlogipNrL/e) 
in order to maximize (8), excluding the third term, subject to condition (7), it follows that the sink node fails to 
decode all the chunks w.p. b.a.b. e, so long as Nt is larger than 

i(fc + (l + o(l))(^ + -.log^)). (10) 

The rest of the proof follows that of Theorem 5. ■ 

In the case of unequal success parameters, the coding delay and the average coding delay are upper bounded as 

follows. 

Theorem 7: The coding delay of a CC with q chunks over a line network of L links with deterministic regular 

traffics and Bernoulli losses with unequal parameters {pi} is larger than 



^ ' fc + (1 + o(l)) l — + \k (ivqhg ""^^ 



P \ \ ""^ V V ^ 

w.p. b.a.b. e, so long as g = o (7gfc/(Llog(fcL/e))), where w ^ ^e {kL'^/{q^og{kL/e)))^ , p == mini<i<LPi, 

7e = miiii<i<L7e;, and 7e, = \pi - Pi-i\. 

Proof: Fix a chunk uj. By replacing p and e with p/q and e/q, respectively, in the proof of Theorem 3, it 

follows that the number of dense packets pertaining to the chunk uj at the sink node fails to be larger than 

pNt p r pNrL 
q \ wq 



w.p. b.a.b. e/q, so long as 



oL^iog^ , (11) 



wqlog = o{jepNT)- (12) 



The rest of the proof is similar to that of Theorem 5, except that (11) excludes the last term in (8), and the choice 
of w needs to satisfy condition (12), instead of condition (7). By specifying w with 



3/ -iIpNtL^ 
\] q\ogipNTL/e) 
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in order to maximize (11) subject to condition (12), it follows that the sink node fails to decode all the chunks w.p. 
b.a.b. e, so long as Nt is larger than 



i[fc + (l + o(l))[^ + A(^«,log^H|- (13) 



In (13), each term, except the largest one, needs to be o{k/p), and this condition is met so long as q is 

ilk 
L\og{kL/e) 



D 



Theorem 8: The average coding delay of a CC with q chunks over a network similar to Theorem 7 is larger than 

l + ol — 



- A:+(l + o(l)), 
P \ \ w 

w.p. b.a.b. e, so long as q ~ o{jek/{f{k)L\og{kL/e))), where w ^ ^ek/{qf{k)\og{kL/e)), and f{k) goes to 
infinity, as k goes to infinity, such that f{k) = o(7efc/(log(/cL/e))). 

Proof: The proof follows the same line as that of Theorem 5, except that the choice of w needs to maximize 

pNr _ ^ (p_NrL\ ^^^^ 



q \ wq 

subject to condition (12). To do so, we specify w by 

lepNr 



qf{pNT)\og{pNTL/e)' 
where f{n) goes to infinity, as n goes to infinity, such that f{n) = o(7en/(log(nL/e))). The sink node fails to 
decode all the chunks w.p. b.a.b. e, so long as Nt is larger than 

i(fc + (l + o(l))(^)). (15) 

The second term in (15) needs to be o(k/p), and this condition is met so long as q is 

7efc 



f{k)L\og{kL/e) 



D 



C. CC with Precoding: Capacity-Approaching with A Gap 

By the results of Section 111-B, one can conclude that CC are not capacity-achieving if the size of the chunks 
does not comply with condition a = aj(Llog(fcL/e)).'" The analysis of Section 111-A further does not apply to CC 
with chunks of small sizes violating the above condition. From a computational complexity perspective, CC with 
chunks of smaller sizes are, however, of more practical interest (e.g., linear-time CC with constant-size chunks). In 
the following, we study CC with chunks of a size constant in the message size. 

Let {pi}i<i<L be an arbitrary sequence of success parameters, and let p = mini<i<Lpj. Let the size of the chunks 
a {— k/q) be a constant in the message size k, i.e., a = 0(1). Fix a chunk, and focus on the packets pertaining 

'"For non-negative functions /(n) and g{n), we write /(n) = cu{g{n)), if and only if lim,i->oo f{'n-)/g{'n-) = oo- 
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to that chunk. Let the time interval {0,Nt] and its w disjoint partitions be defined as before in Section III-A. Let 
ipij be the number of packets (pertaining to the given chunk) in the partition /^ , and ipi be the expected value of 
ipij. Let ip = mini<i<i ipi. Then, ipi = piNx/wq, and ip = pNt/wq. Let Nt = (1 + 'jc)k/p, where < 7c < 1 
is an arbitrarily small constant. By replacing Nt with (1 + 7c)fc/p, ip ^ {1 + 7c)a/w, and cp ~ 0(1), as u; is a 
constant (otherwise, ip goes to 0, as Nt goes to infinity). 

By applying the Chernoff bound, it can be shown that PT{ipij < (1 — 7*)(/3} < e^^ '^, for every < 7* < 1. 
Taking e~^ ^ < ib/wT, it follows that ipij is not larger than or equal to r = (1 — 7*)<p w.p. b.a.b. 7^/10^, where 



7* is the smallest real number satisfying 7* > ^^{l/ip) \n{wT/'ib), such that r is an integer (7* = 0(1)). Taking a 
union bound over all the active partitions of all links, it follows that (pij is not larger than or equal to r w.p. b.a.b. 

ib- 

Let 'D{Ql) be the number of dense packets pertaining to the given chunk in the first j active partitions over the 
i* Unk. 

By applying Lemma 3, it can be shown that: (i) for all l<i<w — L + 1, T>{Q\) > rj, (ii) for all 1 < i < L, 
the inequality V{Q\) > r — log{wT/ib) fails w.p. b.a.b. ijb/wT, and (iii) for all the other i,j, the inequality 
'DiQi) > r-jlog{wT/ib) - log((j + l)wT/ib) fails w.p. b.a.b. ijib/wT, so long as 

a = n(w^\og—\. (16) 

By using the above results, it follows that the number of dense packets pertaining to the given chunk at the sink 
node fails to be lower bounded by 

"^^-of^.W^Vof^log^) (17) 



L \ L \ jb J \ L 76 

w.p. b.a.b. 7f,. The lower bound is non-negative so long as a = O (w \og{wT/^b)), and this condition holds so long 
as condition (16) holds. We specify w by \/aL'^/log{aL/jb) to maximize (17). By replacing w in (16), it can be 
rewritten as 

a = n(L^\og—]. (18) 

By replacing 7^ with jb, and by applying Lemma 4, it follows that the sink node fails to decode the given chunk 
w.p. b.a.b. 76, so long as (17) is larger than a + log(l/7h). By replacing our choice of w in (17), it can be seen 
that, excluding the first term, the second term dominates the rest. By replacing p with (1 + jc)o'./w, and by using 
the properties of the notation 0(.), the decoding condition becomes 

a = n(^\og—). (19) 

Thus, the given chunk is undecodable w.p. b.a.b. -fb, so long as both conditions (18) and (19) are met. In other 
words, the expected fraction of undecodable chunks is bounded from above by 7b. By using a martingale argument 
similar to the one in [3], the concentration of the fraction of undecodable chunks around the expectation can be 
shown as follows. 
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Lemma 5: By applying a CC with chunks of size a, satisfying both conditions (18) and (19), the fraction of 
undecodable chunks at the sink node until time Nt = (1 + ^c)k/p is larger than (1 + 7a)7b, w.p. b.a.b. e, so long 
as 

o^Vlhl - o(fc/log(l/e)), (20) 

where < 7a,7fc,7c < 1 are arbitrary constants. 

By the result of Lemma 5, the fraction of chunks which are not decodable until time Nt becomes larger than 
(l+7a)7b, w.p. b.a.b. e. Since 7a, 76 are non-zero constants, a CC, alone, does not decode all the chunks. However, 
the completion of decoding of all the chunks is guaranteed by devising a proper precoding scheme [3] . The precoding 
works as follows: The set of k message vectors at the source node constitute the input of a capacity-achieving (c.-a.) 
erasure code, called precode. The rate of the precode is 1 — (1 + 7a)76 (i.e., the precede decoder can correct up to a 
fraction (l+7a)7fc of erasures), and the number of the coded packets at the output of the precode, called intermediate 
packets, is (l + (1 + 7a)7fc + 0(7^)) k. By applying a CC with chunks of size a, satisfying conditions (18), (19) 
and (20), the fraction of the intermediate packets that are not recoverable at the output of the CC decoder until 
time (1 + 7c) (l + (1 + 7a)7h + 0(7^)) - is larger than (1 + 7a)76, w.p. b.a.b. e. Then, the precode decoder can 
recover all the k message vectors from the set of recovered intermediate packets. Therefore, the coding delay of a 
CC with precoding (CCP) is upper bounded as follows. 

Theorem 9: The coding delay of a CCP with chunks of size a and a c.-a. erasure code of rate 1 — 7^, over a 
line network of L links with deterministic regular traffics and Bernoulli losses with parameters {pi} is larger than 
(1 + 7c) (1 + (1 + 7a)76 + 0(7^)) |, W.p. b.a.b. e, so long as 

and o? I^l^l = o(fc/log(l/e)), where < ^a^lbilc < 1 are arbitrary constants, and p = mmi<i<L pi. 

In the case of the average coding delay of a CC with precoding, the following can be shown similar to Theorem 9 
by replacing r with ip, and hence the proof is omitted. 

Theorem 10: The average coding delay of a CCP with chunks of size a and a c.-a. erasure code of rate 1 — 7a, 
over a network similar to Theorem 9 is larger than (1 + 7c) (l + (1 + 7a)7h + 0(7^)) -, w.p. b.a.b. e, so long as 

/ L L 

a — il [ — log ■ 



,7c 7h7c 
and a'^/'Jalb — o(fc/log(l/e)), where < ja,'yb,'^c < 1 are arbitrary constants. 

In the special case of unequal success parameters, the coding delay and the average coding delay of CC with 
precoding are upper bounded as follows. The proofs follow the same line as in the general case except that a new 
set of conditions needs to be satisfied based on the assumption that no two success parameters are equal. 

Theorem 11: The coding delay of a CCP with chunks of size a and a c.-a. erasure code of rate 1 — 7a, over 
a line network of L links with deterministic regular traffics and Bernoulli losses with unequal parameters {pi} is 
larger than (1 + 7c) (l + (1 + 7a)7;, + 0(7^)) |, w.p. b.a.b. e, so long as 



7c 7fc7c/ ' \7f 7e76 
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TABLE I 
Comparison of Chunked Codes over Line Networks with Various Traffics 



Traffic 


Success 
Parameters 


Overhead (rj) 

and 

Average Overhead (fj) 


Size of Chunks 

(a) 


w 


Comments 


Arbitrary 
Determitustic 


- 


^ = fj = o(^feL(i log M)^j 


.;(L3log*f) 


- 


7c, = \Pi -Pi-i 

7e = mini<i<i7e. 

p = mini<i<iPi 


Deterministic 

Regular 

Transmissions 

and 

Bernoulli 

Losses 


Arbitrary 


») = i ((1 + o(l)) (M. + feSmH m)) 


c.(LlogM.) 


( -^'' \^ 


W^l 


f5=l((l+o(l))(M+„,)) 


( "^ ^" 


V i°g ^ ; 


Unequal 


,,= i((l + o(l))(M + fc^™i)) 


-(^log^) 




^=i((l + o(l))(M)) 


-(/(^)(^logM)) 


1 / 7ca \ 

f(k) Viogif ; 



TABLE II 

Comparison of Chunked Codes with Precoding (A Capacity- Achieving Erasure Code) over Line Networks with Various 

Traffics 



Traffic 


Success 
Parameters 


Overhead (tj) 

and 

Average Overhead (fj) 


Size of Chunks 
(a) 


Comments 


Arbitrary 
DeteiTninistic 


- 


v = fi = 'yok 


^(f'°^^) 




< 7a, 76: 7c < 1 
{7a,76,7c} = 0(l) 

7o = 7c + (1 + 7c)7o 

7o = {l+7a)76 + 0(762) 

7c, = \Pi -Pi~i\ 

7e = mini<i<i -/e^ 

p = mini<i<_LPi 


Deterministic 

Regular 

Transmissions 

and 

Bernoulli 

Losses 


Arbitrary 


ri = lol 


^({(t>°^^)'(^^i°^i)}) 


<m) 


n = iol 


^(ti°s^) 


Unequal 


ri = lol 


^({(4'°^^)'(^'°^^)}) 


n = io\ 


nf^log ^ ) 

V7|7c '^ Iblc J 



and a^/jalb ^ o(fc/log(l/e)), where < ^a,lbilc < 1 are arbitrary constants, p = mini<i<ip,;, 7e = 
niini<i<L7ei, and 7e; ^ \pi ~ Pi-i\- 

Proof: Let us assume pi > p2 > ■■■ > PL, without loss of generaUty. Let p = niini<i<Lpi, 7,; = 
mini<i<L7ei, and 7e, = |pi - Pi-i\. Fix a chunk. Let r^ = (1 - 7*)(p,, where (^^ = piNr/wq and 7* - 
y^{l/ipi) \og{wT/ib), and < 7b < 1 is an arbitrary constant. Let ipij be the number of packets (pertaining to the 
given chunk) in the partition /^ (the j* partition pertaining to the i* Hnk), where the time interval {0,Nt] is split 
into w partitions of length Nt/w, and let ipi be the expected value of ipij. For all i,j, suppose that ipij is larger 
than or equal to r.;. Let Nt = (1 + jc)k/p, where < 7c < 1 is an arbitrarily small constant. By replacing Nt 
with (1 + 7c)fc/p, (fii = (1 + 'yc)pia/pw, and ip = 0(1), similar to that in the proof of Theorem 9. 

Similarly as before, for all j, 'D{Q-[) > rij. For any other values of i,j, by applying Lemma 3, it can be shown 
that the inequality 'V{Q\) > rij fails w.p. b.a.b. ijib/wr, so long as 



ill ^ log 

.7i 76 



(21) 
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Let (p, 7* and r denote ip^, 7^ and tl, respectively. Thus, the number od dense packets pertaining to the given 
chunk at the sink node fails to be larger than 



a{\+-f,)-0[^\-oUaw\og^]. (22) 



We specify w by 



\0g{wTllb), 

to maximize (22) subject to condition (21). For this choice of w, condition (21) is met so long as 

a = Ql ^\og—\ (23) 

By replacing 76 with 76 in the preceding results, and substituting w in (22), the result of Lemma 4 shows that the 
sink node fails to decode the given chunk w.p. b.a.b. 7(„ so long as (22) is larger than a + log(l/7(,). Based on the 
properties of the notation Vl{.), the latter condition is met so long as 

a^^(L\og—\ (24) 

The rest of the proof is similar to the proof of Theorem 9, except that in this case conditions (23) and (24) need 

to be met, instead of conditions (18) and (19). ■ 

Theorem 12: The average coding delay of a CCP with chunks of size a and a c.-a. erasure code of rate 1 — 7a, 

over a network similar to Theorem 11 is larger than (1 + 7c) (l + (1 + 7a)7& + 0(7^)) -, w.p. b.a.b. e, so long as 

a^Q.[ -^r— log 

and o? jj^'-fl = o(fc/log(l/e)), where < "lailbTlc < 1 are arbitrary constants. 

Proof: The proof follows the same line as that of Theorem 11, except that the choice of w needs to maximize 

ail+j,)-o(^] (25) 

subject to condition (21). To do so, the choice of w needs to be n{L/jc), and hence, condition (21) becomes 

a = n(^\og—). a 

\iiic ibicj 



IV. PoissoN Transmissions and Bernoulli Losses 

In the case of Bernoulli losses and Poisson transmissions with parameters {pi]i<i<L and {\i]i<i<L, the points 
in time at which the arrivals/departures occur over the i* link follow a Poisson process with parameter XiPi. 
Thus the number of packets pertaining to a given chunk, in each partition pertaining to the i* link, has a Poisson 
distribution with the expected value XiPiNT/wq. Since the result of Chemoff bound also holds for Poisson random 
variables, the main results in Section III apply to this case by replacing p with Xp, where {X,p} == {A^,p^}, and 
fj. = argmini<i<i AjPj. 
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V. Discussion 

Table I shows the upper bounds" (w.p. of failure b.a.b. e) on the overhead and the average overhead of CC over 
various traffics for different ranges of the size of the chunks based on the results in Section III and those in [3].'~ 
The traffics are: arbitrary deterministic traffics, or traffics with deterministic regular transmissions and Bernoulli 
losses. We refer to the latter traffics as the probabilistic traffics for simplifying the terminology. The probabilistic 
traffics are categorized into two sub-categories: traffics with arbitrary success parameters and traffics with unequal 
success parameters. In the case of arbitrary deterministic traffics, the capacity is 1, and in the case of probabiUstic 
traffics with success parameters {pi}i<i<L, the capacity is p, where p = nimi<i<LPi- We say that a code is 
"capacity-achieving" (c.-a.) if the ratio of the overhead to k/p goes to 0, as k goes to infinity. Similarly, a code is 
"capacity-achieving on average" (c.-a. a.) if the ratio of the average overhead to k/p goes to 0, as k goes to infinity. 
In Table I, the upper (or lower) row in front of each case of success parameters corresponds to a c.-a. (or a c.-a. a.) 
scenario. 

In the table, one can see that, for each traffic, the size of the chunks (a) has to be sufficiently large so that 
CC are c.-a. or c.-a. a.. For arbitrary deterministic traffics, the lower bound on a is super-logarithmic in k, i.e., 
u!{\ogk), and super-log-cubic in L, i.e., uj{L'^ log L). For the probabilistic traffics with arbitrary or unequal success 
parameters, the lower bound on a has a similar growth rate with k, but a smaller (super-log-linear) growth rate 
with L, i.e., u!{L\ogL). The coding cost of CC (i.e., the ratio of the number of the coding (packet) operations to 
k), is, on the other hand, linear in a. Thus, CC can perform as fast over both the arbitrary deterministic traffics and 
the probabilistic traffics, but with a lower coding cost (smaller chunks) in the latter case compared to the former. 

Moreover, as it can be seen in Table I, for both arbitrary deterministic and probabiUstic traffics (in each case 
of arbitrary or unequal success parameters), the overhead grows sub-log-linearly with k, i.e., 0(fclog^ fc), and 
decays sub-linearly with a, i.e., 0(1/ a^). However, for arbitrary deterministic traffics, the overhead grows with 
0(Llog3 L), and for the probabilistic traffics, it only grows with 0{L^ log^ L). This implies a faster speed of 
convergence to the capacity in the latter case compared to the former Similar comparison result can also be 
observed in terms of the average overhead, except that in the case of unequal success parameters, the average 
overhead decays linearly with a, i.e., 0(l/a), but grows poly-log-linearly with fc, i.e., 0(fclog^ fc), for the choice 
of f{k) = 0(7e logfc), and log-linearly with L, i.e., 0{L\ogL). 

Table II also shows the results for CC with precoding (CCP) in the scenarios similar to those considered in 
Table I, where the precode is an (capacity-achieving) erasure code of dimension k and rate 1 — 70. In particular, 
one can see that CCP are "capacity-approaching" or "capacity-approaching on average" with an arbitrary small 
"non-zero constant" gap 70 (i.e., the ratio of the overhead or the average overhead to k/p goes to 70, as k goes 
to infinity) if a is sufficiently large. For simplifying the terminology, we drop the term "with a non-zero constant 

"with a slight abuse of language, we refer to the "upper bound" on the overhead or the average overhead as the "overhead" or the "average 
overhead." 

'^The results of Section III-B and those of Section III-C were stated in terms of q and a, respectively. In this section, for the ease of 
comparison, the former results are also restated in tenns of a by replacing q with k/a. 
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gap." The upper row (or the lower row) in front of each case of success parameters corresponds to a capacity- 
approaching (or a capacity-approaching on average) scenario. For arbitrary deterministic traffics, the lower bound 
on a is constant in k, and log-cubic in L, i.e., 0{L^ \ogL). For the probabilistic traffics with arbitrary or unequal 
success parameters, the lower bound on a is also constant in k, but has a smaller (log-linear) growth rate with 
L, i.e., 0{LlogL). Thus, in the case of CCP, one can make a conclusion similar to the one made in the case of 
stand-alone CC, with respect to the arbitrary deterministic and the probabilistic traffics. 
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